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'sj" ' We consider Markov basis arising from fractional factorial designs with three- 

■ level factors. Once we have a Markov basis, p values for various conditional tests are 
\ estimated by the Markov chain Monte Carlo procedure. For designed experiments 

with a single count observation for each run, we formulate a generalized linear model 

and consider a sample space with the same sufficient statistics to the observed 

data. Each model is characterized by a covariate matrix, which is constructed 

^ ' from the main and the interaction effects we intend to measure. We investigate 

5— ( ' _ . 

■ fractional factorial designs with 3^ runs noting correspondences to the models for 

3^""^ contingency tables. 
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1 Introduction 

In the past decade, a new application of computational algebraic techniques to statistics 
has been developed rapidly. Diaconis and Sturmfels introduced the notion of Markov 
basis and presented a procedure for sampling from discrete conditional distributions by 
constructing a connected, aperiodic and reversible Markov chain on a given sample space. 
Since then, many works have been published on the topic of the Markov basis by both 
algebraists and statisticians. Contributions of the present authors on Markov bases can 
be found in [2], [3], g], [5], [6], [T], [8], [g, [21] and [22]. On the other hand, series of 
works by Pistone and his collaborators (e.g., [18], [20], [16], [I2] and [T7]) successfully 
applied the theory of Grobner basis to designed experiments. In these works, a design is 
represented as the variety for a set of polynomial equations. 
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In view of these two main areas of algebraic statistics, it is of interest to investigate 
statistical problems which are related to both designed experiments and the Markov basis. 
In [5j we initiated the study of conditional tests of the main effects and the interaction 
effects when count data are observed from a designed experiment. In [5] we investigated 
Markov bases arising from fractional factorial designs with two-level factors. In this paper, 
extending the results in our previous paper, we consider Markov bases for fractional facto- 
rial designs with three-level factors. Motivated by comments by a referee, we also discuss 
relations between the Markov basis approach and the Grobner basis approach to designed 
experiments, although the connection between them are not yet very well developed. In 
considering alias relations for fractional factorial designs, we mainly use a classical nota- 
tion, as explained in standard textbooks on designed experiments such as [23]. We think 
that the classical notation is more accessible to practitioners of experimental designs and 
our proposed method is useful for practical applications. However, mathematically the 
aliasing relations can be more elegantly expressed in the framework of algebraic statistics 
by Pistone et al. We make this connection clear in remarks in Section 2. 

We relate the models for the case of fractional factorial designs to various models of 
contingency tables. In most of the works on Markov bases for contingency tables, the 
models considered are hierarchical models. On the other hand, when we map models for 
fractional factorial designs to models for contingency tables, the resulting models are not 
necessarily hierarchical. Therefore Markov bases for the case of fractional factorial designs 
often have different features than Markov bases for hierarchical models. In particular for 
the fractional factorial designs with three-level factors, we find interesting degree three 
moves and indispensable fibers with three elements. These are of interest also from the 
algebraic viewpoint. 

The construction of this paper is as follows. In Section 2, we illustrate the problem 
of this paper and describe the testing procedure for evaluating p values of the main and 
the interaction effects for controllable factors in designed experiments. Similarly to the 
preceding works on Markov basis for contingency tables, our approach is to construct a 
connected Markov chain for some conditional sample space. We explain how to define 
this sample space corresponding to various null hypotheses. In Section 3, we consider the 
relation between the models for the contingency tables and the models for the designed 
experiments for fractional factorial designs with three-level factors. Then we state prop- 
erties of Markov bases for designs which are practically important. In Section 4, we give 
some discussion. 

2 Markov chain Monte Carlo tests for designed ex- 
periments 

In this section we illustrate the problem of this paper. We consider the Markov chain 
Monte Carlo procedure for conditional tests of the main and the interaction effects of 
controllable factors for the discrete observation derived from various designed experiments. 
Our arguments are based on the theory of the generalized linear models ([15j). 
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Table 1: Design and number of defects y for the wave-solder experiment 
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1 
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37 
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1 
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1 
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52 



2.1 Conditional tests for discrete observations 

Suppose that the observations are counts of some events and one observation is obtained 
for each run of a designed experiment, which is defined by some aliasing relation. (We 
also consider the case that the observations are the ratio of counts in Section 4.) For 
example, Table 1 is a 1/8 fraction of a full factorial design (i.e., a 2^~^ fractional factorial 
design) defined from the aliasing relation 

ABDE = ACDF = BCDG = 1. (1) 

This data set was considered in [5] with some modification from the original data in [H]. 
The original data was reanalyzed in [13]. The observation y in Table 1 is the number of 
defects found in wave-soldering process in attaching components to an electronic circuit 
card. In Chapter 7 of [9], seven factors of a wave-soldering process are considered: (A) 
prebake condition, (B) flux density, (C) conveyer speed, (D) preheat condition, (E) cooling 
time, (F) ultrasonic solder agitator and (G) solder temperature. Each factor of Table 1 
has two-level, which we write or 1 in this paper. The aim of this experiment is to decide 
which levels for each factor are desirable to reduce solder defects. 

Remark 2.1. Specification and notation of aliasing relations in ^ are explained in stan- 
dard textbooks on designed experiments (e.g. 123]) and well understood by practitioners of 
designed experiments. As explained in Section 1.3 and Section 4-6 of ITSf . the aliasing 
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Table 2: Design and observations for a 3^ ^ fractional factorial design 







Factor 
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Run 
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B 


C 


D 




1 














Vi 


2 





1 


1 


2 


1/2 


3 





2 


2 


1 


1/3 


4 


1 





1 


1 


2/4 


5 


1 


1 


2 





2/5 


6 


1 


2 





2 


2/6 


7 


2 





2 


2 


2/7 


8 


2 


1 





1 


2/8 


9 


2 


2 


1 
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relations are more elegantly expressed as a set of polynomials defining an ideal in a poly- 
nomial ring. Consider A, B, . . . , G as indeterminates and let C(A, B, . . . , G) the ring of 
polynomials m A, B, . . . , G with complex coefficients. Then the ideal 

(A^ - 1, B^ - 1, . . . , G^ - 1, ABDE - 1, ACDF - 1, BCDG - 1) (2) 

determines the aliasing relations, i.e., two interaction effects are aliased with each other 
if and only if their difference belongs to the ideal Given a particular term order, the 
set of standard monomials corresponds to a particular choice of saturated model, which 
can he estimated from the experiment. 

Extending the above setting, in this paper, we consider three-level designs with count 
observations. For example. Table 2 shows a 3^~^ fractional factorial design and the ob- 
servations. We write the three levels as {0, 1, 2}. Note that the design in Table 2 is also 
derived from an aliasing relation, 

C = AB, D = AB^. (3) 

We give a more detailed explanation of these aliasing relations in Section 2.2. 

We adopt the theory of the generalized linear models ([15j) as follows. For these types 
of count data, it is natural to consider the Poisson model. Write the observations as 
y = . . . , where k is the number of runs and ?/j's are mutually independently 
distributed with the mean parameter /Xj = E{yi),i = l,...,k. We express the mean 
parameter fii as 

g{fii) = Po + PiXii H h PuXiu, 

where g{-) is the link function and Xn, . . . , Xi^ are the u covariates defined in Section 2.2. 
The sufficient statistic is written as X]i=i^y2/i) J = 1, . . . , z/. The canonical link for the 
Poisson distribution is g{fii) = log/Xj. For later use, we write the //-dimensional parameter 
/3 and the covariate matrix X as 

p = {Po,Pi,...,Pu-iy (4) 
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and 



/ 1 X 
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a^ii^-i 



\ 



X 




(5) 



y 1 Xki ■■ ■ Xku-i J 

where Ik = (1, . . . , 1)' is the fc-dimensional column vector consisting of I's. Then the 
hkehhood function is written as 



which imphes that the sufficient statistic for (3 is X'y = (l^y,x']^y, . . . ,x^_]^y). 

To define a conditional test, we specify the null model and the alternative model in 
terms of the parameter j3. To avoid confusion, we express the free parameters under the 
null model as the //-dimensional parameter (jl]) in this paper. Alternative hypotheses are 
expressed in terms of additional parameters. For example, in the case of various goodness- 
of-fit tests, the alternative model is the saturated model, i.e., f3 is fc-dimensional. Then 
the null and the alternative models are written as 



respectively. On the other hand, if we consider significance test of a single additional effect 
(which can be a main effect or an interaction effect), the alternative model is written in 
the form of 



where m = 1 for the case of two-level factors considered in [5j. For the case of three-level 
factors, m is 2,4,8, . . ., depending on the degree of freedom for the factors we consider. 
We explain this point in Section 2.2. 

Depending on the hypotheses, we also specify appropriate test statistic T(y). For 
example, the likelihood ratio statistic or the Pearson goodness-of-fit statistic are frequently 
used. Once we specify the null model and the test statistic, our purpose is to calculate 
the p value. Similarly to the context of the analysis of the contingency tables, Markov 
chain Monte Carlo procedure is a valuable tool, especially when the traditional large- 
sample approximation is inadequate and the exact calculation of the p value is infeasible. 
To perform the Markov chain Monte Carlo procedure, the key notion is to calculate a 
Markov basis over the sample space 

JF(X'y°) = {y I X'y = X'y°, i/i is a nonnegative integer for z = 1, . . . , k}, (7) 




Ho: (/5„...,A_i) = (0,...,0) 
Hi: (/5„...,/5fc_i)^(0,...,0) 



Hi: (/?„...,/5,+™)^(0,...,0), 



(6) 
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where y° is the observed count vector. Once a Markov basis is calculated, we can construct 
a connected, aperiodic and reversible Markov chain over ([7]). By the Metropolis-Hastings 
procedure, the chain can be modified to have a stationary distribution as the conditional 
distribution under the null model, which is written as 

k 

f{y\X'y = X'y'') = C{X'y")\[- 

i=i y^- 

where C{X'y°) is the normalizing constant determined from X'y° defined as 

y&Tix'y) \i=i y^-' / 

For the definition of Markov basis see [I^ and for computational details of Markov chains 
see [12]. In applications, it is most convenient to rely on algebraic computational softwares 
such as 4ti2 ([!]) to derive a Markov basis. 



2.2 How to define the covariate matrix 

As we have seen in Section 2.1, it is a key formalization to express various models by the 
covariate matrix X. The matrix X is constructed from the design matrix to refiect the 
main and the interaction effects of the factors which we intend to measure. 

For the case of two- level factors, we have already considered this problem in [5]. In 
the case of two-level factors, each main effect and interaction effect can be represented 
as one column of X. This is because each main and interaction effect has one degree of 
freedom in the two- level case. For example of Table 1, the main effect model of the seven 
factors. A, B, C, D, E, F, G can be represented as the 16 x 8 covariate matrix by defining 
Xj G {0, 1}^^ in ([5]) as the levels for the j-th factor given in Table 1. If we intend to 
include, for example, the interaction effect of A x B, the column 

(1,1,1,1,0,0,0,0,0,0,0,0,1,1,1,1)' 

is added to X, which represents the contrast of A x B. It should be noted that Markov 
basis for testing the null hypothesis depends on the model, namely the choice of various 
interaction effects included in X. 

In this paper, we consider the case of three-level designs. To explain three-level frac- 
tional factorial designs, first we consider 3^ full factorial designs. Since a 3^ full factorial 
design is a special case of a multi-way layout, we can use the notions of ANOVA model. 
In this case, each main effect has two degrees of freedom since each factor has three levels. 
Similarly, each two-factor interaction has (3 — 1) (3 — 1) =4 degrees of freedom, three- 
factor interaction has (3 — 1) (3 — 1) (3 — 1) = 8 degrees of freedom and so on. As is noted 
in [23], these sum of squares are further decomposed into components, each with two 
degrees of freedom. Consider, for example, two-factor interaction A x B. We write the 
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levels of the factors A, B, C, . . . as a,b,c . . . G {0, 1, 2} hereafter. Then A x B interaction 
effect is decomposed to two components denoted AB and AB^, where AB represents the 
contrasts satisfying 

a + 6 = 0,1,2 (mod 3), 
and AB^ represents the contrasts satisfying 

a + 26 = 0, 1,2 (mods), (8) 

respectively. Since the contrasts compare values at three levels, 0, 1, 2 (mod 3), each of AB 
and AB^ has two degrees of freedom. We note the contrasts given by ([8]) are equivalent 
to the contrasts given by 

2a + 6 = 0,1,2 (mod 3), 

by relabeling the indices. Following to [23], we use the notational convention that the 
coefficient for the first nonzero factor is 1, to avoid ambiguity. Similarly, ra-factor inter- 
action effects, which have 2^ degrees of freedom, can be decomposed to 2""^ components 
with two degrees of freedom. For example, the three-factor interaction A x B x C is 
decomposed to the 4 components 

ABC,ABC2,AB2C,AB2C2 

and the four-factor interaction AxBxCxDis decomposed to the 8 components 

ABCD, ABCD2, ABC^D, ABC^D^, 
AB^CD, AB^CD^, AB^C^D, AB^C^D^. 

Now we explain how to define the covariate matrix X. For the full factorial designs, X 
is constructed to include the main and the interaction effects with two columns for each 
component of two degrees. For example of 3^ full factorial design, the covariate matrix for 
the main effects model of A, B, C is written as 

111111111111111111111111111 
111111111000000000000000000 
000000000111111111000000000 
X' = 111000000111000000111000000 
000111000000111000000111000 
100100100100100100100100 

10010010010010010 



1 

1 1 1 



(We show the transpose of X to save space hereafter.) Note that the first column repre- 
sents the total mean effect, the second and the third columns represent the contrasts of 
the main effect of A and so on. When we also consider the interaction effect A x B, the 
following four columns are added to X, 



1 1 1 



1 1 1 







1 















1 
1 
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where each pair of columns represents the contrasts of AB and AB^, respectively. For the 
saturated model, there are 27 columns in X, i.e., one column for the total mean effect, 
6(= 2 X 3) columns for the contrasts of the main effects of the factors A, B, C, 12(= 4 x 3) 
columns for the contrasts of the two-factor interaction effects of A x B, A x C, B x C and 
8 columns for the contrasts of the three-factor interaction effect of A x B x C. 

Next we consider the fractional factorial designs. Recall a 3^~^ fractional factorial 
design in Table 2 of Section 1. In this design, since each main effect has two degrees 
of freedom, the model of the main effects for all factors. A, B, C, D, is nothing but the 
saturated model. To consider the models with interaction effects, we consider the designs 
of 27 runs. For example, 3'^"^ fractional factorial design of resolution IV is defined by the 
aliasing relation D = ABC. The relation D = ABC means that the level d of the factor 
D is determined by the relation 

d = a + h + c (mod 3), 

which can also be equivalently written as 

a + 6 + c + 2ci = 0, 1, 2 (mod 3). 

Therefore this aliasing relation is also written as ABCD^ = I. By the similar modulus 3 
calculus, we can derive all the aliasing relations as follows. 



I = 


ABCD^ 








A = 


: BCD2 = 


AB^C^D 


B = 


ACD2 = AB^CD^ 


C = 


: ABD2 = 


ABC^D^ 


D = 


ABC = ABCD 


AB 


= CD2 = 


ABC^D 


AB2 


= AC^D = BC^D 


AC 


= BD2 = 


AB^CD 


AC2 


= AB^D = BC^D^ 


AD 


= AB^C^ 


= BCD 


AD2 


= BC = AB^C^D^ 


BC^ 


= = AB^D 


2 = AC^D^ 


BD : 


= AB^C = ACD 


CD 


= ABC^ : 


= ABD 







(9) 



From ([9]), we can clarify the models where all the effects are estimable. For example, the 
model of the main effects for the factors A, B, C, D and the interaction effects A x B is 
estimable, since the two components of A x B, AB and AB^, are not confounded to any 
main effect. Among the model of the main effects and two two-factor interaction effects, 
the model with A x B and A x C is estimable, while the model with A x B and C x D 
is not estimable since the components AB and CD^ are confounded. In |23j, main effects 
or components of two-factor interaction effects are called clear if they are not confounded 
to any other main effects or components of two-factor interaction effects. Moreover, a 
two-factor interaction effect, say A x B is called clear ii both of its components, AB and 
AB^, are clear. Therefore ([H]) implies that each of the main effect and the components, 
AB^, AC^, AD, BC^, BD, CD are clear, while there is no clear two-factor interaction effect. 

Remark 2.2. As in Remark \2.1\ the aliasing relations in ^ can he more elegantly de- 
scribed in the framework of ITSf - We consider the polynomial ring C(A, B, C, D) in in- 
determinates A,B, C,D. An important note here is that, when we consider polynomials 
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in C(A, B, C,D), we cannot treat two monomials as the same even if they designate the 
same contrast by relabeling indices (and hence we cannot use the notational convention of 
JE^). Therefore the aliasing relations ^ have to be more fully written as 

I = ABCD2, 

A = B^C^D = A^BCD^, A^ = BCD^ = AB^C^D, 

: (10) 

AB = C^D = A^B^CD^, AB^ = A^CD^ = BC^D, 
A^B = AC^D = B^CD^, A^B^ = CD^ = ABC^D . 

For example A = BCD^ = AB^C^D in where A and A^ are identified as the same 
contrast, is split into two aliasing relations A = B^C^D = A^BCD^, A^ = BCD^ = 
AB^C^D, where A and A^ are treated as different monomials. 
We first consider the polynomials 

A^ - 1, B^ - 1, - 1, - 1. (11) 

Note that the roots of = 1 are 1,00,00"^, where u = cos(27r/3) + i 8111(271/3) is the cube 
root of unity. Therefore corresponds to labeling the three levels of the factors A, . . . , D 
as 1,00 or uj"^ . In the case of two-level factors, this corresponds to labeling levels as +1 
and —1 (rather than and 1). Then the ideal 

(A^-1,B3-1,C^-1,D3-1,D-ABC) (12) 

determines the aliasing relations, i.e., two interaction effects are aliased in the sense of 
( fi^) if and only if their difference belongs to [T^) . For example, A and B^C^D are aliased 
since 

A - B^C^D 

= (B^C^D - A)(A3 - 1) - A4C3(B3 - 1) - A\C^ - 1) - A3b2C2(D - ABC) 
e (A3 - 1, B^ - 1, - 1, D3 - 1, D - ABC) . 

Note that in Example 29 of fTSf . three levels of a factor are coded as {—1,0, 1} and 
the polynomials A^ — A, . . . , — D are used for determining the design ideal. These 
polynomials and coding by { — 1,0, 1} do not correspond to factorial designs considered in 
this section. 

3 Correspondence to the models for contingency ta- 
bles 

In this section, we investigate relation between fractional factorial designs with 3^~^ runs 
and contingency tables. Since Markov bases have been mainly considered in the context 
of contingency tables, it is convenient to characterize the relations from the viewpoint of 
hierarchical models of contingency tables. For the 2^~^ fractional factorial designs, we have 
considered this topic in [5j. In this paper, we show that many interesting indispensable 
fibers with three elements appear from the three-level designs. 
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3.1 Models for the full factorial designs 



First we consider 3^ full factorial design and prepare a fundamental fact. Our idea is to 
index observations as y = iyi^-ip)-, 1 < • • • , V — 3; instead of y = . . . , Uk)' ■, k = 3^, 
to investigate the correspondence to the 3^ contingency table. In the case of the 3^ full 
factorial design, for example, the contrasts for each factor and the observation are written 
as follows. 



In this case, we see that the sufficient statistic for the parameter for the total mean is 
expressed as y.. and, under given y.., the sufficient statistic for the parameter of the main 
effects of the factors A and B are expressed as yi. and y.j, respectively. Moreover, it is 
seen that adding contrasts for AB and AB^ yields the saturated model. Note that this 
relation also holds for higher dimensional contingency tables, which we summarize in the 
following. We write the controllable factors as Ai, A2, A3, . . . instead of A, B, C . . . here. 
We also use the notation of D-marginal in the p-dimensional contingency tables for D C 
{1, ■ ■ ■ ,p} here. For example, {l}-marginal, {2}-marginal, {3}-marginal of y = {yijk) are 
the one-dimensional tables {yi-.}, {y-j-}, {V- k}, respectively, and {1, 2}-marginal, {1,3}- 
marginal, {2, 3}-marginal of y = (yijk) are the two-dimensional tables {yij-}, {Vi-k}, {V-jk}, 
respectively. See |Tl] for the formal definition. 

Fact 3.1. For 3^ full factorial design, write observations as y = {yii---ip)- Then the 
necessary and the sufficient condition that the {ii, . . . ,in}-niarginal n- dimensional table 
{n < p) is uniquely determined from X'y is that the covariate matrix X includes the 
contrasts for all the components of m-f actor interaction effects A^^ x Aj,^ x ■ ■ ■ x Aj^ for 
all {ji, • • • , jm} C {ii, . . .,in},m < n. 

This fact is easily proved as follows. The saturated model for the 3^^ full factorial 
design is expressed as the contrast for the total mean, 2xn contrasts for the main effects, 
2™ X contrasts for the m-factor interaction effects for m = 2, . . . , tt,, since they are 
linearly independent and 



Run A B AB AB^ 



y 



1000 

2011 2 

3 2 2 1 

4 10 1 1 

5 112 

6 12 2 

7 2 2 2 

8 2 10 1 

9 2 2 1 



yn 

yi2 

yn 

1/21 
1/22 
1/23 
1/31 
1/32 
1/33 
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3.2 Models for the fractional factorial designs 

Fact 3.1 states that hierarchical models for the controllable factors in the 3^ full factorial 
design corresponds to the hierarchical models for the 3^ contingency table completely. 
On the other hand, hierarchical models for the controllable factors in the 3^"'' fractional 
factorial design do not correspond to the hierarchical models for the 3^ contingency table 
in general. This is because X contains only part of the contrasts of interaction elements in 
the case of fractional factorial designs. Consequently, many interesting structures appear 
in considering the Markov basis for the fractional factorial designs. 

As a simplest example, we first consider a design with 9 runs for three controllable 
factors, i.e., 3^~^ fractional factorial design. Write three controllable factors as A, B,C, 
and define C = AB. The design is represented as Table 2 by ignoring the factor D. In 
this design, the covariate matrix for the main effects model of A, B, C is defined as 

"111111111" 
1 1 1 
1 1 1 
X' ^ 100100100 

1 1 1 

1 1 1 
1 1 1 

To investigate the structure of the fiber, write the observation as a frequency of the 3x3 
contingency table, yn, . . . , 1/33. Then the fiber is the set of tables with the same row sums 
{Ui.}, column sums {y.j} and the contrast displayed as 






1 


2 


1 


2 





2 





1 



To construct a minimal Markov basis, we see that the moves to connect the following 
three-elements fiber are sufficient 



1 











1 











1 






1 











1 


1 















1 


1 











1 






Therefore any two moves from the set 



+1 


-1 





+1 -1 


-1 


+1 



+1 


-1 


-1 


+ 1 





-1 +1 






+1 -1 


-1 


+1 


+1 


-1 



is a minimal Markov basis. In the following, to save the space, we use a binomial repre- 
sentation. For example, the above three moves are written as 



yiiy22y33 - yi2y23y3i, ^112/222/33 - 2/132/212/32, 2/122/232/31 - 2/132/212/32- 
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In this paper, we consider three types of fractional factorial designs with 27 runs, 
which are important for practical applications. We investigate the relations between 
various models for the fractional factorial designs and the 3x3x3 contingency table. In 
the context of the Markov basis for the contingency tables, Markov basis for the 3x3x3 
contingency tables have been investigated by many researchers, especially for the no 
three-factor interaction model by [3]. In the following, we investigate Markov bases for 
some models, especially we are concerned about their minimality, unique minimality and 
indispensability of their elements. These concepts are presented in [21j and [8j. In this 
paper, we define that a Markov basis is minimal if no proper subset of it is a Markov basis. 
A minimal Markov basis is unique if there is only one minimal Markov basis except for 
sign changes of their elements. An element of a Markov basis is represented as a binomial. 
We call it a move following our previous papers. A move z is indispensable if z or — z 
belongs to every Markov basis. 

3^y^ fractional factorial design defined from D = ABC In the case of four control- 
lable factors for design with 27 runs, we have a resolution IV design by setting D = ABC. 
As seen in Section 2.2, all main effects are clear, whereas all the two-factor interactions 
are not clear in this design. 

For the main effect model in this design, the sufficient statistic is written as 

{Vt-}^ {y-j }^ {y-k} 

and the contrasts of ABC, 

yiU + yi23 + 2/132 + 2/213 + 2/222 + 2/231 + 2/312 + 2/321 + 2/333, 
2/112 + 2/121 + 2/133 + 2/211 + 2/223 + 2/232 + 2/313 + 2/322 + 2/331, 
2/113 + 2/122 + 2/131 + 2/212 + 2/221 + 2/233 + 2/311 + 2/323 + 2/332- 

By calculation by 4ti2, we see that the minimal Markov basis for this model consists of 
54 degree 2 moves and 24 degree 3 moves. All the elements of the same degrees are on 
the same orbit (see [7], [6]). The elements of degree 2 connect three-elements fibers such 
as 

{2/1122/221, 2/1212/212, 2/1222/211} (13) 
into a tree, and the elements of degree 3 connect three-elements fibers such as 

{2/1112/1222/133, 2/1122/1232/131, 2/1132/1212/132} (14) 
into a tree. For the fiber f[T^ . for example, two moves such as 

2/1212/212 - 2/1122/221, 2/1222/211 - 2/1122/221 

are needed for a Markov basis. See ^21j for detail on the structure of a minimal Markov 
basis. 

Considering the aliasing relations given by ([9]), we can consider models with interaction 
effects. We see by performing 4ti2 that the structures of the minimal Markov bases for 
each model are given as follows. 
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• For the model of the main effects and the interaction effect A x B, 27 indispensable 
moves of degree 2 such as ?/ii3?/32i — l/iiiZ/323 and 54 dispensable moves of degree 
3 constitute a minimal Markov basis. The degree 3 elements are on two orbits, 
one connects 9 three-elements fibers such as f lT^ and the other connects 18 three- 
elements fibers such as {?/iii?/i33y2i2, 2/112^1312/213, 2/1132/1322/211}- 

• For the model of the main effects and the interaction effects A x B, A x C, 6 dis- 
pensable moves of degree 3, 81 indispensable moves of degree 4 such as 

2/1122/1212/2132/221 - 2/1112/1222/2112/223 

and 171 indispensable moves of degree 6, 63 moves such as 

2/1122/1212/1332/2132/2222/231 - 2/lll2/l232/l322/21l2/2232/232 

and 108 moves such as 

2/ll22/l2l2/2132/23l2/31l2/323 - 2/lll2/l222/21l2/2332/3132/321 , 

constitute a minimal Markov basis. The degree 3 elements connect three-elements 
fibers such as (fT^ . 

• For the model of the main effects and the interaction effects A x B, A x C, B x C, 
27 indispensable moves of degree 6 such as 

2/1132/1212/1322/2112/2222/233 " 2/lll2/l222/l332/2132/22l2/232 

and 27 indispensable moves of degree 8 such as 

2/lll2/l222/l332/2122/22l2/3132/331 " 2/ll22/ll32/l2l2/l3l2/21l2/2222/31l2/333 

constitute a unique minimal Markov basis. 

• For the model of the main effect and the interaction effects A x B,A x C,A x D, 
6 dispensable moves of degree 3 constitute a minimal Markov basis, which connect 
three-elements fibers such as (fl^ . 

3j7/ fractional factorial design defined from D = AB, E = AB^C In the case of 

five controllable factors for designs with 27 runs, the contrasts for the two main factors 
are allocated by two aliasing relations. 

In this paper, we consider two designs from Table 5A.2 of [23]. First we consider the 
3j7/ fractional factorial design defined from D = AB, E = AB^C. 

For this design, we can consider the following nine distinct hierarchical models (except 
for the saturated model). Minimal Markov bases for these models are calculated by 4ti2 
as follows. 
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For the model of the main effects of A, B, C, D, E, 27 indispensable moves of degree 2 
such as |/ii2y22i — 2/iiiy2225 56 dispensable moves of degree 3, 54 indispensable moves 
of degree 4 such as 

2/112^/121^2312/312 - 2/1112/1312/2122/322 

and 9 indispensable moves of degree 6 such as 

2/1222/1312/2112/2322/3122/321 " 2/lll2/ll22/2222/321 

constitute a minimal Markov basis. The degree 3 elements are in 3 orbits, which 
connects three types of three-elements fibers, i.e., 

18 moves for 9 fibers such as {2/1112/1232/132, 2/1132/1222/131, 2/1122/1212/133}, 
36 moves for 18 fibers such as {2/1112/1232/212, 2/1132/1222/211, 2/1122/1212/213} and 
2 moves for the fiber {2/1122/2232/331, 2/1312/2122/323, 2/1212/2322/313}- 

For the model of the main effects and the interaction effect A x C, 18 dispensable 
moves of degree 3, 162 indispensable moves of degree 4 such as 

2/1122/1212/2132/221 - 2/1112/1222/2112/223, 

135 indispensable moves of degree 5 such as 

81 moves of the form 2/ii22/ii32/i2i2/22i2/33i - 2/iii2/i222/i232/23i2/3ii and 
54 moves of the form 2/n22/i2i2/22i2/33i - 2/iii2/i222/i322/2ii2/32i 

and 54 indispensable moves of degree 6 such as 

2/1112/1222/1332/2112/2222/233 " 2/ll22/l232/l3l2/2132/22l2/232 (15) 

constitute a minimal Markov basis. The degree 3 elements connect three-elements 
fibers such as 

{2/1112/1232/132, 2/1122/1212/133, 2/1132/1222/131 } (16) 

For the model of the main effects and the interaction effect C x E, 27 indispensable 
moves of degree 2 such as 2/1122/221 — 2/1112/222 constitute a unique minimal Markov 
basis. 

For the model of the main effects and the interaction effects A x C, A x E, 6 dis- 
pensable moves of degree 3 and 81 indispensable moves of degree 6 such as 

2/1222/1312/2112/2322/3122/321 " 2/lll2/ll22/22l2/2222/33l2/332 

constitute a minimal Markov basis. The degree 3 elements connect three-elements 
fibers such as (fT6|) . 



14 



• For the model of the main effects and the interaction effects A x C, B x C, 27 
indispensable moves of degree 4 such as 

1/112^121^/2322/311 - 2/1112/1322/2122/321 

and 54 indispensable moves of degree 6 such as 

2/1122/1212/1332/2112/2232/232 - 2/lll2/l232/l322/2122/22l2/233 

constitute a unique minimal Markov basis. 

• For the model of the main effects and the interaction effects A x C, C x E, 27 
indispensable moves of degree 4 such as 

2/1112/1322/2112/222 - 2/1122/1312/2122/221 

and 54 indispensable moves of degree 6 such as (fT5|) constitute a unique minimal 
Markov basis. 

• For the model of the main effects and the interaction effects A x C, A x E, C x E, 9 
indispensable moves of degree 6 such as 

2/1132/1222/1312/2122/2212/233 - 2/lll2/l232/l322/21l2/2232/232 

constitute a unique minimal Markov basis. 

• For the model of the main effects and the interaction effects A x C, B x C, C x D, 
9 indispensable moves of degree 6 such as 

2/1222/1312/2112/2322/3122/321 - 2/lll2/ll22/22l2/2222/33l2/332 

constitute a unique minimal Markov basis. 

• For the model of the main effects and the interaction effects A x C, B x C, C x E, 9 
indispensable moves of degree 6 such as 

2/1122/1212/2112/2322/3222/331 - 2/lll2/l222/2122/23l2/32l2/332 

constitute a unique minimal Markov basis. 

3j7/ fractional factorial design defined from D = AB, E = AB^ Next we consider 
3^7/^ fractional factorial design defined from D = AB,E = AB^. For this design, we can 
consider the following four distinct hierarchical models (except for the saturated model). 
Minimal Markov bases for these models are calculated by 4ti2 as follows. 

• For the model of the main effects of A, B, C, D, E, 108 indispensable moves of degree 
2 such as 2/1122/121 — 2/iii2/i22 constitute a unique minimal Markov basis. 
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• For the model of the main effects and the interaction effect A x C, 27 indispensable 
moves of degree 2 such as 2/1122/121 — 2/1112/122 constitute a unique minimal Markov 
basis. 

• For the model of the main effects and the interaction effects A x C, B x C, 27 
indispensable moves of degree 4 such as 

2/1122/1212/2112/222 - 2/1112/1222/2122/221 
and 54 indispensable moves of degree 6 such as 

2/1122/1212/1332/2112/2232/232 - 2/lll2/l232/l322/2122/22l2/233 

constitute a unique minimal Markov basis. 

• For the model of the main effects and the interaction effects A x C, B x C, C x D, 
9 indispensable moves of degree 6 such as 

2/lll2/l322/2122/22l2/3222/331 ^ 2/ll22/l3l2/21l2/2222/32l2/332 

constitute a unique minimal Markov basis. 

4 Discussion 

In this paper, we investigate a Markov basis arising from the fractional factorial designs 
with three-level factors. As noted in Section 1, the notion of a Markov basis is one of 
the fundamental key words in the first work of the computational algebraic statistics. 
Moreover, the designed experiment is also one of the areas in statistics where the theory 
of the Grobner basis found applications. Since we give another application of the theory 
of the Grobner basis to the designed experiments, this paper relates to both of the works 
by Diaconis and Sturmfels ([10]) and Pistone and Wynn ((18]). 

Though we suppose that the observations are counts in Section 2, our arguments can 
also be applied to the case that the observations are the ratio of counts. In this case, we 
consider the logistic link function instead of the logit link, and investigate the relation 
between 3'^"'' fractional factorial designs to the S^"^"*"^ contingency tables. See [5] for the 
two- level case. 

One of the interesting observations of this paper is that many three-elements fibers 
arise in considering minimal Markov bases. In fact, in the examples considered in Section 
3.2, all the dispensable moves of minimal Markov bases are needed for connecting three- 
elements fibers, where each element of the fibers does not share supports in each other. 
This shows that the every positive and the negative part of the dispensable moves is a 
indispensable. See notion of the indispensable monomial in [8\. 

It is of great interest to clarify relationships between our approach and the works by 
Pistone, Riccomagno and Wynn [TB]. In [L6\, designs are defined as the set of points (i.e.. 



16 



the affine variety), and the set of polynomials vanishing at these points (i.e., the design 
ideal) are considered. They calculate the Grobner basis of the design ideal, which is used 
to specify the identifiable models or confounding relations. In Section 2 we explained that 
the aliasing relations for fractional factorial designs specified in the classical notation can 
be more elegantly described in the framework of [TB]. It is important to study whether a 
closer connection can be established between a design ideal and the Markov basis (toric 
ideal). It should be noted, however that a Markov basis depends on the covariate matrix 
X, which incorporates the statistical model we aim to test, whereas the Grobner basis 
depends only on the design points under a given term order. 

Finally as suggested by a referee, it may be valuable to consider relations between the 
arguments of this paper and designs other than fractional factorial designs, such as the 
Plackett-Burman designs or the balanced incomplete block designs. These topics are left 
to future works. 
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